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Abstract 

Quantum discord was proposed as an information the- 
oretic measure of the "quantumness" of correlations. I 
show that discord determines the difference between the 
efficiency of quantum and classical Maxwell's demons - 
that is, entities that can or cannot measure nonlocal ob- 
servables or carry out conditional quantum operations ~ 
in extracting work from collections of correlated quantum 
systems. 



I. DEMONS 

Maxwell's demon^ was introduced to explore the role 
of information and, more generally, to investigate the 
place of "intelligent observers" in physics. In modern dis- 
cussions of the subject^ "intelligence" is often regarded 
as predicated upon or even synonymous with the in- 
formation processing ability - with computing. Thus, 
Maxwell's demon is frequently modeled by a Turing ma- 
chine - a classical computer - endowed with the ability 
to measure and act depending on the outcome. The role 
of a demon is to implement an appropriate conditional 
dynamics - to react to the state of the system as revealed 
through its correlation with the state of the apparatus. 

It is now known that quantum logic - i.e., logic em- 
ployed by quantum computers - is in some applications 
more powerful than its classical counterpart. It is there- 
fore intriguing to enquire whether a quantum demon - an 
entity that can measure non-local states and implement 
quantum conditional operations - could be more efficient 
than a classical one. I show that quantum demons can 
extract more work than classical demons from correla- 
tions between quantum systems, and that the difference 
is given by the quantum discord, a recently introduced'^"^ 
measure of the "quantumness" of correlations. 

Maxwell's demon sets up a useful conceptual frame- 
work that provides an operational interpretation of dis- 
cord. The role played by the quantum demon - carrying 
out conditional quantum operations on pairs of systems 
- could be also fulfilled the classical device that can out- 
right measure non-local quantum observables. This is 
especially apparent in section IV where we alternate be- 
tween the quantum and classical demon on one hand, and 
Alice and Bob on the other. The real point of employ- 
ing demons is to draw attention to the thermodynamic 



(and information - theoretic) costs of various operations 
and - in a sense - to hold Alice and Bob accountable 
for their thermodynamic expenditures which are usually 
simply ignored. 



II. DISCORD 

Quantum discord'^"^ is the difference between two clas- 
sically identical formulae that measure the information 
content of a pair of quantum systems. Several closely re- 
lated variants can be obtained starting from the original 
definition"^ given in terms of mutual information^. Mu- 
tual information quantifies the strength of correlations 
between, say, the apparatus A and the system S: 



I{S : A) - H{S) + H{A) - H{S,A) 



(1) 



It measures the difference between the missing informa- 
tion about the two objects when they are taken sepa- 
rately, H{S)+H{A), and jointly, H{S,A) (see Fig. 1). In 
the extreme case S and A may be identical - e.g., copies 
of the same book, or a state of the apparatus pointer A af- 
ter a perfect but as yet unread measurement of S. Then 
the joint entropy H{S,A) is equal to H{A) — H{S), 
so I{S : A) — H{A). By contrast, when the two ob- 
jects are not correlated, H{S,A) — H{S) + H{A), and 
I[S ■.A)^Q. 

The other formula for mutual information employs 
classical identity for joint entropy^: 

H{S, A) = H{A) + H{S\A) - H{S) + H{A\S) (2) 

Above H{S\A) is the conditional entropy - e.g., the mea- 
sure of the lack of information about the state of S given 
the state of A. Substituting this in Eq. (1) leads to an 
asymmetric looking formula for mutual information: 

Ja{S : A) = H{S) + H{A) - [H{A) + H{S\A)] (3) 

We have refrained from carrying out the obvious can- 
cellation above that would have yielded Ja{S ■ A) — 
H{S) — H{S\A) for a reason that will be soon apparent. 
Discord is defined as: 

S{S\A) = I{S : AyjAiS : A) - [H{A)+H{S\A}]-H{S,A) 

(4) 

Classically, discord disappears as a consequence of Eq. 
(2) - information about a collection of classical objects 
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can be acquired one object at a time. In quantum the- 
ory, however, measurements can modify quantum state. 
Thus, in order to properly define conditional entropy one 
must specify how the apparatus is "interrogated" about 
S: After a measurement of the observable with eigen- 
states {l^fe)} observer's own description of the pair is 
the conditional density matrix: 

PSA\Ak) = PS\Ak) ^ \Ak){Ak\ (5) 

Given an outcome \Ah), he will attribute Ps\Ak) = 
{Ak\psA\Ak) /pA{k) to S with the probability p^(fc) = 
Tr{Ak\psA\Ak)- Even for an outsider (who has not yet 
found out the outcome) post-measurement density ma- 
trix p'g^ usually differs from the pre-measurement psA- 
This outsider state of knowledge should be contrasted 
with the viewpoint of the insider who made the mea- 
surement: Insider knows that the apparatus is in the 
state \Ak). Outsider does not, so he obtains his post- 
measurement p'g^ by averaging over the outcomes: 

PsA = J2P^(''^Ps\A,)®\Ak){Ak\ (6) 

k 

Outsider's description of the pair is unaffected by the 
insiders measurements - p'^^ = psA ^ only when the 
measured observable commutes with psA- We shall find 
outsider viewpoint useful because it represents a statis- 
tical ensemble of all possible outcomes. 

In quantum physics one definition denoted by J4(iS : 
A{\A^)]) is the locally accessible mutual information. It 
uses Eq. (3) with the joint entropy given by: 

Hj,{S, A{\A,)}) = [H{A) + H{S\A)]{\A,)} (7) 

where {[A^;)} is the eigenbasis of the to-be- measured ob- 
servable of the apparatus. Another acceptable and com- 
pletely quantum definition of I{S : A) relies on the the 
von Neumann entropy of the density matrix psA describ- 
ing the joint state. Then: 

H{S,A) = -TrpsAhPsA = - lgP5^(0 (8) 

I 

where probabilities psa{1) ai'e the eigenvalues of psA that 
describes the correlated pair. These eigenvalues always 
exist, but in general correspond to entangled quantum 
states IV's^(O) joint Hilbert space of S and A. 

Such states cannot be found out through sequences of lo- 
cal measurements starting with just one subsystem of the 
pair - say, A. This fundamental difference between the 
quantum and the classical realm (where such "piecewise" 
investigation is always possible and need not disturb the 
state of the pair) is responsible for non-zero discord. 

A simple example of this situation is a perfectly entan- 
gled Bell state: 

|V'5^) = (|050^) + |lsU))/\/2 (9a) 



Clearly, psA = \'>PsA){'>i-'SA\ is pure - the pair is in the 
state \tpsA)- Hence, in accord with Eq. (8), H{S,A) = 0. 
On the other hand, pa{s) = Trs(A)PSA = l.A(5)/2, 
where 1 is the unit matrix in the appropriate Hilbert 
space, so that H{A) = H{S) = 1. Consequently, 
/(«S : A) = 2, but the asymmetric mutual information 
is Ja{S : A) = 1. This is because the joint informa- 
tion HA{S,A{\Ak)}) defined with reference to any mea- 
surement on a A, Eq. (5), is a sum of H{A) = 1 and 
H{S\A) = 0. In our example both of these quantities are 
independent of the basis because of the symmetry of Bell 
states. 

Readers are invited to verify that a classical correlation 

in: 

PSA = (|050^)(0^05| + |l5U)(Ul5|)/2 (96) 

results in zero discord, but only when the preferred basis 
= {|0), |1)} is employed. The entangled state of 
Eq. (9a) could be converted into the mixture of Eq. (9b) 
through cinselection of the preferred (pointer) basis'*'*^^^"'^ 
or - and this is why decoherence can be regarded as moni- 
toring by the environment - through a measurement with 
an undisclosed outcome carried out in the same pointer 
basis = {|0), |1)}. 

In general, the ignorance of the outsider cannot de- 
crease (but may increase) as a result of a measurement 
of a known observable (by the insider), as the outsider 
does not know the outcome^^. Hence, 

S{S\A{iA,)}) = Ha{S, A^iA,)}) - H{S, A) > (10) 

Discord disappears only when psA remains unaffected by 
a partial measurement of on the A end of the pair 

- when the information is locally accessible. 

III. DEMONS AND DISCORD 

The relevance of the discord for the performance of 

Maxwell's demon can be now appreciated. Demons are 
insiders. They use the acquired information to extract 
work from their surroundings. The traditional scenario 
starts with an interaction establishing initial correlation 
between the system and the apparatus. The demon then 
reads off the state of A, and uses so acquired information 
about S to extract work by letting S expand throughout 
the available phase (or Hilbert) space of volume (dimen- 
sion) ds while in contact with the thermal reservoir at 
temperature Ti'^'i^"^^. This yields: 

W+ = kB,T{\gds - H{S\A)) (11) 

of work obtained at a price: 

W- = kB,TH{A) (12) 
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Above, ks-i is the Boltzmann constant adapted to deal 
with the entropy expressed in bits and T is the temper- 
ature of the heat bath. The net gain is then: 

W = kB,T{\gds~[H{A) + H{S\A)]) (13a) 

The price W~ is the cost of restoring the apparatus to the 
initial ready-to-measure state. The significance of this 
"cost of erasure" for the second law was pointed out in the 
seminal paper of Szilard^^. Its relevance in the context 
of information processing was elucidated and codified by 
Landauer^^. 

It is now accepted that, because of the cost of erasure, 
neither classical^^"^^ nor quantum^^"^^ demons can vi- 
olate the second law. However, a demon with a supply 
of empty memory (used to store measurement outcomes) 
can extract, on the average, of work per step from 
a thermal reservoir. This strategy works, because, in ef- 
fect, demon is using its empty memory as a zero entropy 
(and, hence, T = 0) reservoir: A memory block of size 
is used up with each new measurement. This is ex- 
pensive (and wasteful) and only fraudulent accounting 
(uncovered by Szilard and Landauer) that ignores ther- 
modynamic cost of empty memory can create appearance 
of a violation of the second law. 

To optimize performance demon should use memory of 
A more efficiently. The obvious strategy here is to com- 
press bits of the data after a sequence of measurements, 
freeing an unused block of A/^ bits. Demon can compress 
data Ak to the size given by K(Ak), their algorithmic 
complexity-^^ . With compression A/j. = Igd^ — K{Ak) 
memory bits per cycle are saved. Moreover, one can show 
that for long sequences of data the approximate equality 
{K{Ak)) ~ H{A) becomes exact, so that the saved up 
memory is on the average Afj, = lgd^ — H{A). By being 
frugal, classical Maxwell's demon can gain, per step, net 
work of^'''^^: 

W = kB,T{lgdsdA - [H{A) + H{S\A)]) (136) 

When S and A arc classically correlated so that Eq. (2) 
applies, this can be written as: 

W = kB,T{lg dsdA - H{S, A)) (13c) 

We note that the efficiency is ultimately determined by 
the information about S and A accessible to the demon, 
and that the same equation would have followed if we 
simply regarded the SA pair as a composite system, and 
the demon used it all up as a fuel. 

The efficiency of demons is then determined by the 
accessible information about the pair SA - the relevant 
joint entropy - and we have already seen in Eqs. (7) and 
(8) that in quantum physics it depends on how the infor- 
mation about the pair is acquired. A classical demon is 
local - it operates one-system-at-a-time on the correlated 
quantum pair SA. In this case the above sketch of the 



"standard operating procedure" applies with one obvious 
caveat: It needs to be completed by the specification of 
the basis demon measures in A. The cost of erasure is still 
given by Eq. (12), also for classical demons extracting 
work from quantum systems^^"^^, although the relevant 
H{A) may increase as a result of decoherence that con- 
verts quantum entanglement into classical data^''. Thus 
classical demons operating on pairs of quantum systems 
gain net work of: 

W^'/kB.T = Ig dsA - [H{A) + H{S\A)]^iA,)} (14) 

The only difference between the classical Eq. (13a) and 
the quantum Eq. (14) is the obvious dependence on the 
basis ll^fe)} demon selects to measure. The expression 
in square brackets is the measure of the remaining (con- 
ditional) ignorance and of the cost of erasure. We shall 
be interested in the {|^fe)} that maximize W'^ . 

A quantum demon can typically extract more work - 
get away with lower costs of erasure - because its mea- 
surement can be carried out in a global basis in the com- 
bined Hilbert space of SA corresponding to observables 
that commute with the initial psA and avoid increase of 
entropy associated with decoherence'*'*^"^^'^'^. The work 
that can be extracted after the apparatus gets reset to 
its ready-to-measure state is: 

W^/kB.T = Ig dsA - H{S, A) (15) 

The other way to arrive at Eq. (15) is to use quan- 
tum demon in its capacity of a universal quantum com- 
puter, which, by definition, can transform any state in 
the Hilbert space into any other state (see Fig. 2 for 
an example of a model demon that operates on pairs of 
qubits). This allows the quantum demon to revcrsibly 
evolve entangled eigenstates of an arbitrary known psA 
into product states of some psA with the same eigenval- 
ues, and, hence, same entropy which can be then manip- 
ulated in a local basis that does not perturb its eigen- 
states, and, hence, as viewed by the outsider, it will not 
suffer any additional increase of entropy. The work ex- 
tracted by the optimal quantum demon is limited simply 
by the basis-independent joint von Neumann entropy of 
the initial psA, Eq. (8). 

The difference between the eflSciency of the quantum 
and classical demons can be now immediately character- 
ized: 

A = AW/kB,T = [H{A) + H{S\A)][iA,)} - H{S,A) 

(16) 

or: 

AW = kB,T5{S\A[\A,)}) (17) 

Equation (17) relating the extra work AW = W^ — W^ 
to quantum discord - to the difference of the accessible 
joint entropy of classical (local) and quantum (global) 
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demons - is the principal result of our paper. It answers 
an interesting "demonic" question while simultaneously 
providing an operational interpretation of discord. 

To gain further insight into implications of the above, 
let us first note that discord is, in general, basis depen- 
dent. Discord disappears iff the density matrix has the 
"post-decoherence" (or "post-measurement") form, Eq. 
(6), already before the measurement. Given the ability 
of classical demons to match quantum performance stan- 
dard in this case, basis that allows for the disap- 
pearance of discord in the presence of non-trivial corre- 
lation can be deemed classical^"^. We note that psA in 
the locally diagonal form, Eq. (6) , may emerge as a con- 
sequence of the coupling of ^ with the environment^^^^. 
The preferred pointer basis is a result of einselection. 

A typical pg^ does not have the form of Eq. (6), how- 
ever. In that case discord does not completely disappear 
for any basis. The least discord: 

S{S\A) = 7mn{\A,)}[H{A) + ff(5|^)]{|A,)} - H{S, A) 

(18) 

corresponds to maximum efficiency of a classical demon. 
Note that to get the right answer we had to minimize 

the sum of the two terms contributing to the joint en- 
tropy, rather than each of them separately. The alterna- 
tive d{S\A) = HiA) +min[\A,)}H{S\A)[iA,)} - H{SA) 
would have followed if the cancellation in Eq. (3) was 
carried out. The difference between them is obvious, and 
S{S\A) > d{S\A). 



IV. DEMONS AND DISCORD 

Discord is not symmetric between the two ends of the 
correlation: In general, 5{S\A) ^ S{A\S). In particular, 
for density matrices that emerge following einselection 
in A S(S\A) will vanish but (5(^|5) may remain finite. 
Such locally accessible correlations are one-way classical. 
They are characterized by a preferred direction - from A 
to iS - in which more information about the joint state 
can be accessed. Thus, when a local demon can choose 
between the two "ends" of the SA pair, it may be more 
efficient than a one-way demon. Indeed, one could define 
polarization: 

vu{S\A) = 5{S\A) - 5{A\S) (19) 

to quantify this directionality. 

One can generalize discord to collections of correlated 
quantum systems. By analogy with the case of a sin- 
gle pair we define it as a difference between the joint 
entropy accessible through a particular sequence of (pos- 
sibly conditional) measurements - that is, the obvious 
generalization of Eq. (7) - and the joint von Neumann 
entropy of the unmeasured density matrix. The least dis- 
cord of such a collection of systems is a minimum over 



all possible sequences of all possible measurements. This 
corresponds to the demon having a choice of the end of 
the pair it can measure first. 

This last situation allows one to address questions 
raised in the recent paper on the work that can be ex- 
tracted by local and global observers from correlated 
pairs of quantum systems^**. The authors show that a 
global observer will be able to extract more work from 
a pair of quantum systems than "Alice and Bob", who 
can carry out local operations and communicate classi- 
cally ("LOCC") with each other, and the difference A 
(in units of fcsj^) is bounded from below: 

A> A = maxA^s{H{A),H{S)) - H{A,S) (20) 

This lower bound is illustrated in Fig. 3 along with our 
result given by S for Werner states. In this case discord 
yields the work deficit, A = S. Indeed, our arguments 
throughout the paper show that S gives the difference in 
efficiencies when only one-way classical communication 
is allowed. Obviously, allowing for a two-way communi- 
cation between Alice and Bob can only help, so j > A is 
an upper bound on the difference of the efficiencies, and 
there arc cases (e.g., Werner states of Fig. 3) where this 
upper bound is saturated. 

This leads to an interesting questions: When does 
two-way communication provide a significant advantage? 
Does a single round of two-way communication always 
suffice, or is it possible that many iterations may help 
even more? The nested density matrix of the form: 

PA,B,C,...,S = '^PA{i)PB,C,...,S\Ai} O (21a) 



PB,C,...,S\A.) = E^'B 0>C,...,5|A.,B«>®l^f I - ^t-- 

(216) 

where every second of the subsystems (i.e., A,C,£, ... is 
on Alice's side, while the complement B,T>,... are on 
Bob's side) settles these questions. When the condi- 
tional density matrices PB,e,...,S\Ai}, Pc,...,S\Ai,Bj), etc., 
are not co-diagonal in the relevant Hilbert spaces describ- 
ing BjCjV, ... Alice and Bob will have to exchange data 
after each measurement to decide what to measure next 
if they are to extract all of the potentially accessible in- 
formation. This is an example of a situation where a 
number of back-and-forth exchanges equal to the num- 
ber of "nestings" is necessary to extract all of the work 
to access all of the locally accessible information. It 
is tempting to suggest that such nested density matrices 
could be used to restrict access to information by hiding 
it in some sufficiently deep layer (c. g., S), accessible 
only if two (or many) parties cooperate in its retrieval. 
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V. SUMMARY 
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Fig. 1 Information - tlicorctic measures of the relation- 
ship between A and S can be illustrated by means of 
the Venn diagram shown above. Shaded areas represent 
various imccrtaintics. Joint entropy H{S,A) is the mea- 
sure of uncertainty about the combined state of S and A. 
Individual circles correspond to the uncertainty about S 
and A. When their states arc correlated, the two circles 
overlapp. Mutual information I{S : A) is the area of that 
overlapp. Conditional entropy H{S\A) is one of the half- 
moons above the one left when the lens correspond- 
ing to the mutual information I{S : A) is subtracted 
from H{S). Obviously, H{S,A) = H{A) + H{S\A) = 
H{S) + H{A\S) = H{S) + H{A)-IiS : A). These equal- 
ities are predicated on the classical assumption that the 
states of 5 and A exist objectively, and, thus, a measure- 
ment need not disturb them. In quantum theory this 
is not the case: a measurement will in general redefine 
the state of the measured object, even for a "outsider" 
who does not know its outcome. Indeed, for a generic 
quantum state of the pair SA a measurement of A alone 
would increase the uncertainty of the outsider - would 
increase the entropy he attributes to the pair. This is a 
consequence of the difference between the nature of joint 
states in classical physics (where they are represented by 
Cartesian products of subspaces of the constituents) and 
quantum physics (where they exist in a tensor product 
of the two Hilbert spaces). It has profound effects on the 
accessibility of the information and leads to a difference 
in the efficiency of Maxwell's demons. 



Fig. 2 A simple model of information processing built 
of the controlled not (c-not) logical gates illustrates the 
origin of the difference between the efficiencies of the clas- 
sical and quantum versions of Maxwell's demon. The aim 
of this demon (whose memory is marked by V) is to use 
the correlation between S and A to purify their individ- 
ual states, so that they can be used individually as fuel. 
To this end, the demon; (a) finds out the state of the 
apparatus A; (b) uses this information about A and the 
"promise" of the correlation - prior knowledge about the 
form of PSA embodied in the logical circuit above - to 
decrease entropy of <S; so that = ks^Til. — H{S\A)) 
work can be extracted from S, (c) resets his own mem- 
ory to the initial ready - to - measure state, so that the 
same sequence of actions can be carried out cyclically on 
a whole ensemble of identical SA pairs. Thus, a quan- 
tum demon operating on an initially entangled state will 
result in an evolution: 

{a\OsOA) + l3\lsU))\0v) = 

a\OsOT>OA) + 0\1sOv1a) ^" a|0s02,0^) + /3|l5lpU) 

a\OsO-DOA)+P\OslvlA) a\OsOvOA) + P\OsO-DlA} 
= |05)|0p)(a|0^)+/3|U)) 

disentangling S from A. However, a demon whose mem- 
ory decoheres - e.g., entangles with the environment - 
will not be able to take advantage of the quantum cor- 
relations in the state of SA pair. Decoherence lead- 
ing to the einselection of the basis {|0x)), \ lv)} in the 
memory of the demon can be represented by another 
c-not that acts between V influencing the state of the 
environment £ (not shown in the figure). As a con- 
sequence, following the c-not (a) interaction with the 
environment leads to (alOsOuO^) -|- P\Os^v'i-A))\^o) 
Q:|050x)0^)|iro)+/3|05li5l^)|ei). Thus, when all the other 
cnot's are carried out; |0s)|02?)(a|0^)|£o) -|- P\1a)\£i)) 
obtains, leading to the same pure states of the system 
and the demon, but (in effect) a mixed state of the ap- 
paratus, 

PA = Tr£{a\OA)\eo)+P\lA)\ei)){a{OA\{eo\+P{lA\{ei\) 
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providing that (eo|ei) = 0. In this case, decoherence 
which turns demons performance from quantum to clas- 
sical makes it impossible to extract all of the thermody- 
namic benefit from quantum correlations. We leave it as 
an exercise for the reader to show that the classical corre- 
lation between S and A (Eq. (9b)) leads to the same final 
state, thus proving that a classical demon can extract all 
of the work present in a classical correlation. 
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Fig. 3 Discord S{z), Eq. (17), and the lower bound on 
the work deficit A(z) for Werner states, psA = "'r^l + 

z\iJSA){M, where ^jsa) = {\OsOa) + \1s1a})/V2. In 
this simple case both discord (which is equal to the work 
deficit) and the lower bound on the work deficit derived 
in Ref. 24 are independent of the basis and the same 
for both "ends" of the correlated pair. As argued here, 
there are cases where discord will actually play a role of 
an upper bound on the work deficit, as it is derived under 
the assumption of one-way classical communication. 
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